模型选择过程通常是单一准则决策,在该决策中,我们选择了在特定集合中最大化特定度量的模型,例如验证集的性能。我们声称这非常天真,由于过度搜索现象,可以对过度拟合的模型进行糟糕的选择,从而高估了该特定集合的性能。futhermore,现实世界数据包含模型选择过程不应忽略的噪声,并且在执行模型选择时必须考虑到。此外,我们定义了四个理论最优条件,我们可以追求这些条件,以更好地选择模型并通过使用多标准决策算法(TOPSIS)来分析它们,该算法(TOPSIS)认为代理最佳条件以选择合理的模型。
translated by 谷歌翻译
神经网络体系结构的定义是执行最关键和最具挑战性的任务之一。在本文中,我们提出了平行密码。ParallelMLPS是一种可以通过探索现代CPU和GPU的局部性和并行功能的原理来实现具有不同数量神经元和激活功能的几个独立多层感知神经网络的训练。该技术的核心思想是使用修改的矩阵乘法,该矩阵乘法将序数矩阵乘法替换为两个简单的矩阵操作,这些矩阵操作允许梯度流动的单独且独立的路径,可以在其他情况下使用。我们已经在模拟数据集中评估了我们的算法,该数据集使用10,000种不同的模型来改变样品,功能和批次的数量。如果与顺序方法相比,我们实现了从1到4个数量级的训练速度。
translated by 谷歌翻译
建立强大的确定性神经网络仍然是一个挑战。一方面,某些方法以降低某些情况下的分类准确性为代价改善了分布检测。另一方面,某些方法同时提高了分类准确性,不确定性估计和分布外检测,但以降低推理效率为代价。在本文中,我们提出了使用Dismax损失的培训确定性神经网络,这是对通常的软马克斯损失的倒入替换(即,线性输出层的组合,软磁性激活和交叉透射率损失) 。从Isomax+损失开始,我们根据所有原型的距离创建每个logit,而不仅仅是与正确类关联的logit。我们还引入了一种结合图像的机制,以构建所谓的分数概率正则化。此外,我们提出了一种快速训练后校准网络的方法。最后,我们提出一个复合分数以执行分布外检测。我们的实验表明,Dismax通常在分类准确性,不确定性估计和分布外检测方面同时优于当前方法,同时保持确定性的神经网络推断效率。重现结果的代码可在https://github.com/dlmacedo/distinction-maximization-loss上获得。
translated by 谷歌翻译
目前的分销检测方法通常存在特殊要求(例如,收集异常数据和近似数计验证)并产生副作用(例如,分类精度下降和缓慢/低效推论)。最近,已经提出了熵外检测作为无缝方法(即,避免所有先前提到的缺点的解决方案)。熵外检测解决方案使用ISOMAX损失进行培训和分布外检测的熵分。 ISOMAX损失作为软墨损失的替换(即输出线性层,Softmax激活和跨熵损失的组合)作为替换,因为随着ISOMAX损失的交换软墨损失不需要变化模型的架构或培训程序/超级参数。在本文中,我们执行我们所谓的ISOMAX损失中使用的距离的成像化。此外,我们提出更换最小距离分数的熵分数。实验表明,这些修改显着增加了分布的检测性能,同时保持解决方案无缝。除了竞争或优于所有主要目前的方法外,提出的解决方案除了更容易使用之外,还避免了所有当前限制,因为只需要对培训神经网络的简单损失替代品。用ISOMAX +丢失替换SoftMax丢失并重现结果的代码可在https://github.com/dlmacedo/entropic-out-of-distribution-detection。
translated by 谷歌翻译
Traditionally, data analysis and theory have been viewed as separate disciplines, each feeding into fundamentally different types of models. Modern deep learning technology is beginning to unify these two disciplines and will produce a new class of predictively powerful space weather models that combine the physical insights gained by data and theory. We call on NASA to invest in the research and infrastructure necessary for the heliophysics' community to take advantage of these advances.
translated by 谷歌翻译
This work proposes a framework developed to generalize Critical Heat Flux (CHF) detection classification models using an Unsupervised Image-to-Image (UI2I) translation model. The framework enables a typical classification model that was trained and tested on boiling images from domain A to predict boiling images coming from domain B that was never seen by the classification model. This is done by using the UI2I model to transform the domain B images to look like domain A images that the classification model is familiar with. Although CNN was used as the classification model and Fixed-Point GAN (FP-GAN) was used as the UI2I model, the framework is model agnostic. Meaning, that the framework can generalize any image classification model type, making it applicable to a variety of similar applications and not limited to the boiling crisis detection problem. It also means that the more the UI2I models advance, the better the performance of the framework.
translated by 谷歌翻译
Motivated by mitigating potentially harmful impacts of technologies, the AI community has formulated and accepted mathematical definitions for certain pillars of accountability: e.g. privacy, fairness, and model transparency. Yet, we argue this is fundamentally misguided because these definitions are imperfect, siloed constructions of the human values they hope to proxy, while giving the guise that those values are sufficiently embedded in our technologies. Under popularized methods, tensions arise when practitioners attempt to achieve each pillar of fairness, privacy, and transparency in isolation or simultaneously. In this position paper, we push for redirection. We argue that the AI community needs to consider all the consequences of choosing certain formulations of these pillars -- not just the technical incompatibilities, but also the effects within the context of deployment. We point towards sociotechnical research for frameworks for the latter, but push for broader efforts into implementing these in practice.
translated by 谷歌翻译
When developing deep learning models, we usually decide what task we want to solve then search for a model that generalizes well on the task. An intriguing question would be: what if, instead of fixing the task and searching in the model space, we fix the model and search in the task space? Can we find tasks that the model generalizes on? How do they look, or do they indicate anything? These are the questions we address in this paper. We propose a task discovery framework that automatically finds examples of such tasks via optimizing a generalization-based quantity called agreement score. We demonstrate that one set of images can give rise to many tasks on which neural networks generalize well. These tasks are a reflection of the inductive biases of the learning framework and the statistical patterns present in the data, thus they can make a useful tool for analysing the neural networks and their biases. As an example, we show that the discovered tasks can be used to automatically create adversarial train-test splits which make a model fail at test time, without changing the pixels or labels, but by only selecting how the datapoints should be split between the train and test sets. We end with a discussion on human-interpretability of the discovered tasks.
translated by 谷歌翻译
尽管公平感知的机器学习算法一直在受到越来越多的关注,但重点一直放在集中式的机器学习上,而分散的方法却没有被解散。联合学习是机器学习的一种分散形式,客户使用服务器训练本地模型,以汇总它们以获得共享的全局模型。客户之间的数据异质性是联邦学习的共同特征,这可能会诱导或加剧对由种族或性别等敏感属性定义的无私人群体的歧视。在这项工作中,我们提出了公平命运:一种新颖的公平联合学习算法,旨在实现群体公平,同时通过公平意识的聚合方法维持高效用,该方法通过考虑客户的公平性来计算全球模型。为此,通过使用动量术语来估算公平模型更新来计算全局模型更新,该术语有助于克服嘈杂的非直接梯度的振荡。据我们所知,这是机器学习中的第一种方法,旨在使用公平的动力估算来实现公平性。四个现实世界数据集的实验结果表明,在不同级别的数据异质性下,公平命运显着优于最先进的联邦学习算法。
translated by 谷歌翻译
惯性辅助系统需要连续的运动激发,以表征测量偏差,这些偏差将使本地化框架需要准确的集成。本文建议使用信息性的路径计划来找到最佳的轨迹,以最大程度地减少IMU偏见的不确定性和一种自适应痕迹方法,以指导规划师朝着有助于收敛的轨迹迈进。关键贡献是一种基于高斯工艺(GP)的新型回归方法,以从RRT*计划算法的变体之间实现连续性和可区分性。我们采用应用于GP内核函数的线性操作员不仅推断连续位置轨迹,还推断速度和加速度。线性函数的使用实现了IMU测量给出的速度和加速度约束,以施加在位置GP模型上。模拟和现实世界实验的结果表明,IMU偏差收敛的计划有助于最大程度地减少状态估计框架中的本地化错误。
translated by 谷歌翻译